Estimating FST and kinship for arbitrary population structures
نویسندگان
چکیده
F ST and kinship are key parameters often estimated in modern population genetics studies order to quantitatively characterize structure relatedness. Kinship matrices have also become a fundamental quantity used genome-wide association heritability estimation. The most frequently-used estimators of method-of-moments whose accuracies depend strongly on the existence simple underlying forms structure, such as independent subpopulations model non-overlapping, independently evolving subpopulations. However, data sets revealed that these models likely do not hold many populations, including humans. In this work, we analyze behavior presence arbitrarily-complex structures, which results an improved estimation framework specifically designed for arbitrary structures. After generalizing definition structures establishing assessing bias consistency estimators, calculate accuracy existing under characterizing biases challenges unobserved their originally-assumed structure. We then present our new approach, consistently estimates when minimum value dataset is consistently. illustrate using simulated genotypes from admixture model, constructing one-dimensional geographic scenario departs nontrivially model. Our simulations reveal potential severe approaches overcome by framework. This work may significantly improve future analyses rely accurate estimates.
منابع مشابه
Kinship and Population Subdivision
The coefficient of kinship between two diploid organisms describes their overall genetic similarity to each other relative to some base population. For example, kinship between parent and offspring of 1/4 describes gene sharing in excess of random sharing in a random mating population. In a subdivided population the statistic Fst describes gene sharing within subdivisions in the same way. Since...
متن کاملEstimating kinship in admixed populations.
Genome-wide association studies (GWASs) are commonly used for the mapping of genetic loci that influence complex traits. A problem that is often encountered in both population-based and family-based GWASs is that of identifying cryptic relatedness and population stratification because it is well known that failure to appropriately account for both pedigree and population structure can lead to s...
متن کاملEstimating and interpreting FST: the impact of rare variants.
In a pair of seminal papers, Sewall Wright and Gustave Malécot introduced FST as a measure of structure in natural populations. In the decades that followed, a number of papers provided differing definitions, estimation methods, and interpretations beyond Wright's. While this diversity in methods has enabled many studies in genetics, it has also introduced confusion regarding how to estimate FS...
متن کاملModified Sampling Strategies Using Correlation Coefficient for Estimating Population Mean
This paper proposes two sampling strategies based on the modified ratio estimator using the population mean of auxiliary variable and population correlation coefficient between the study variable and the auxiliary variable by Singh and Tailor (2003) for estimating the population mean (total) of the study variable in a finite population. A comparative study is made with usual sampling strategies...
متن کاملEvent Structures for Arbitrary Disruption
In process algebras that allow for some form of disruption, it is important to state when a process terminates. One option is to include a termination action √ . Another approach is that the ‘final’ executed action of a process terminates the process. The semantics of the former approach has been investigated in the literature in detail, e.g. by providing consistent true-concurrency and operati...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: PLOS Genetics
سال: 2021
ISSN: ['1553-7404', '1553-7390']
DOI: https://doi.org/10.1371/journal.pgen.1009241